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Abstract 

We introduce and investigate a new model of a finite number of particles jumping forward on the real 
£N) . line. The jump lengths are independent of everything, but the jump rate of each particle depends on the 

relative position of the particle compared to the center of mass of the system. The rates are higher for those 
left behind, and lower for those ahead of the center of mass, providing an attractive interaction keeping the 
particles together. We prove that in the fluid limit, as the number of particles goes to infinity, the evolution 
of the system is described by a mean field equation that exhibits traveling wave solutions. A connection to 
extreme value statistics is also provided. 

Keywords: Competing particles, center of mass, mean field evolution, traveling wave, fluid limit, extreme 
value statistics 
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1 Introduction 



Interacting particle systems always show interesting behavior and present a challenge for rigorous treatment. 
In this paper we introduce and investigate a new type of model which — to the best of our knowledge — has not 
emerged in the literature before. The system consists of a finite number of particles moving forward on the 
real line IL The center of mass of these particles is computed and, given a configuration, each particle jumps 
after an exponential waiting time that depends on its relative position compared to the center of mass. The 
more a particle lags behind the center of mass, the less its mean waiting time is; on the other hand, the more 
a particle is ahead of the center of mass, the more its mean waiting time is. When the exponential time has 
passed, the particle jumps forward a random distance, independent of everything else, drawn from a probability 
distribution concentrated on the non-negative half-line. While doing so, it naturally moves the center of mass 
t-H ■ forward by 1/n times its jump length as well, where n is the number of particles. Thus the dynamics has an 
effect of keeping the particles together, while randomness creates fluctuations in the system. 

Several real-life phenomena can be modeled this way: the evolution of prices set by agents trying to sell the 
same product in a market, the position of goats in a herd, etc. Mathematical finance is an area of active research, 
and many processes arising in finance are modeled with interacting particle systems, see e.g. the monograph on 
stochastic portfolio theory by Fernholz [13] . Banner, Fernholz and Karatzas [5J, and the survey by Fernholz and 
Karatzas |16j . Herding behavior has drawn serious interest in the physics literature, a few examples are Vicsek, 
Czirok, Ben- Jacob, Cohen and Shochet [32], Czirok, Barabasi and Vicsek [TTj . Ballerini, Cabibbo, Candelier, 
Cavagna, Cisbani, Giardina, Lecomte, Orlandi, Parisi, Procaccini, Viale and Zdravkovic [3]. 

The problem of competing individuals is interesting even without dependence of the dynamics on the actual 
position of the particles. A series of papers have been written on models where the competing particles evolve 
according to i.i.d. increments: Ruzmaikina and Aizenman (27) . Arguin [I], Arguin and Aizenman [2], Shkol- 
nikov |28j . The main point of investigation in these papers is characterizing point processes for which the joint 
distribution of the gaps between the particles is invariant under the evolution. 

Several authors have considered continuous models in which the motion (drift) of the particles depends on 
their rank in the spatial ordering; see e.g. Banner, Fernholz and Karatzas [5J, Pal and Pitman [25], Chatterjee 
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and Pal [5], Shkolnikov [5^1 EH 120] ■ These models are central objects of study in modeling capital distributions 
in equity markets. Branching diffusion-type models have also been investigated with a linear interaction be- 
tween particles, see e.g. Greven and Hollander Q15], Englander [T^] and the references therein. Linearity of the 
interaction is an important ingredient in these papers making the analysis somewhat easier. 

Closer to the present work are jump processes with interaction between the jumping particles. A related 
model is introduced in ben-Avraham, Majumdar and Redner [6], where particles compete with a leader who 
looks back only at the contestant directly behind him. Interaction between the leader and the followers is realized 
via a linear dependence of the mean jump length on the relative position of the leader and the followers. A 
process with rank-dependent jump rates is investigated in Greenberg, Malyshev and Popov [18 , and one with 
similar, but randomized jump rates in Manita and Shcherbakov [53]. Maybe the closest model to our system is 
the one featured in Grigorescu and Kang |20j . where particles jump in a configuration-dependent manner, and 
the authors give special focus to the situation where the dependence of particle advances on the configuration 
is via the center of mass of the particles. 

In our case, positive jump lengths are independently chosen, and the jump rate of a particle depends on 
its position relative to the center of mass. This jump rate function is non-increasing and non-negative, hence 
clearly cannot be linear in nontrivial cases. The setting is thus genuinely different from situations in the above 
examples. The non-increasing jump rate function creates a monotonicity in the system which keeps the particles 
together. This gives rise to a traveling wave-type stationary distribution. We address the following questions. 

• The two-particle case is, of course, tractable. We compute the stationary distribution of the gap between 
the two particles for several choices of the jump length distribution. However, we cannot handle the 
situation even for three particles. (The process is not reversible.) 

• As the number of particles goes to infinity, we expect that the motion of the center of mass converges to 
a deterministic one. This motion, together with the distribution of the particles, satisfy a deterministic 
integro-differential equation in this so-called fluid limit. We call the resulting equation the mean field 
equation. We write up this equation by heuristic arguments and, in some cases, we solve it for a traveling 
wave form, i.e. the stationary solution as seen from the center of mass. Even finding the traveling wave 
solution to this mean field equation is nontrivial. 

The reason for expecting a deterministic limiting equation is the averaging property of the dynamics. 
We in fact prove this phenomena in this paper later on for some cases. As the motion of the center of 
mass becomes deterministic, the particles decouple, each independently following the deterministically 
moving center of mass. Therefore in the limit it does not matter if we consider a deterministic cloud of 
particles (the limit coming from the empirical distribution of n — > oo particles), or a single particle in the 
corresponding distribution that is driven by the deterministic center of mass. The evolution of the fluid 
limit can therefore be called a mean field dynamics. Of course, the motion of the center of mass must be 
chosen appropriately to be consistent with the cloud or distribution of particles. In the traveling wave 
case, the distribution, as seen from the center of mass, becomes stationary, and the speed of the center of 
mass becomes constant in time. 

• Some traveling wave solutions lead us to Gumbel distributions, which naturally pose the question whether 
an extreme value theory interpretation can be given to the process. An idea by Attila Rakos shows that 
there is indeed such a connection. 

• Under restrictive assumptions on the jump rates, we prove that the evolution of the process in the fluid 
limit indeed satisfies the deterministic integro-differential equation we formulated by heuristic arguments. 
We use weak convergence methods and uniqueness results on the integro-differential equation. Since the 
center of mass is not a bounded function of the particle positions, standard weak convergence techniques 
need to be extended to our situation. 

This last point is the most interesting from a mathematical point of view. Our proof is a standard fluid limit 
argument, somewhat similar to those in Manita and Shcherbakov 24] and Grigorescu and Kang [20]: we use 
an extended notion of weak convergence, tightness, and martingales to show convergence to the unique solution 
of the limiting integro-differential equation. As opposed to Greenberg, Malyshev and Popov [IB] , asymptotic 
independence of the particles (as the number of particles tends to infinity) will be a consequence of the proof, 
rather than a method to rely on. This is because our case of dependence through the center of mass does not 
seem to provide an easy a priori way of showing asymptotic independence of the particles. 

We start by precisely describing the model in Section [5] We formulate our main results in Section [3] first, 
we formulate the limiting mean field equation together with some traveling wave solutions, and we also state 
the fluid limit theorem. These are the results we believe to be the most important, and additional statements 
can be found in subsequent sections. We end this section with a collection of open questions. We proceed by 
analyzing the two-particle case in Section[4] The mean field equation is investigated in Section [5] In particular, 
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a connection with extreme value statistics is heuristically explained in Section 15.21 Finally, Section [5] contains 
our fluid limit argument under some restrictive conditions on the jump rates. 

The reader may find an extended version of this paper in [3] , where we provide more details on arguments 
that are not essential for our main results, particularly in Sections S] and [5j In addition, see [3J Section 7] 
for simulation results which support the idea that the mean field equation holds in the fluid limit in greater 
generality. 

2 The model 

To model the phenomena described in the Introduction, we present the following model. The model consists 
of n particles on the real line. Let us denote the position of the particles by x\ (t) , x% (t) , . . . , x n (t), where t 
represents time and (t) £ R for i = 1, . . . , n. We now describe the dynamics. Given a configuration, the mean 
position (center of mass) of the particles is m n (t) = ^ X)"=i x i (*)• Now let us denote by w : R — > R + the jump 
rate function, a positive and monotone decreasing function. The dynamics is a continuous-time Markov jump 
process. The i th particle, positioned at xi (t) at time t, jumps with rate w (a;, (t) — m n (t)). That is, conditioned 
on the configuration of the particles, jumps happen independently after an exponentially distributed time, with 
the parameter of the i th particle being w (xi (t) — m n (i)). 

When a particle jumps, the length of the jump is a random positive number Z from a specific distribution. 
This is independent of time and the position of the particle and all other particles as well. We scale the jump 
length so that EZ — 1, and for technical reasons we assume that Z has a finite third moment. If this distribution 
is absolutely continuous then its density is denoted by (p. 

Our focus is on the empirical measure of the n particle process at time t: 

I " 

i=i 

More generally, we are interested in the path of the empirical measure, (i n (•) = (fi n (t)) t>0 , with initial condition 

Mn (0) = ~Ei=l<5* 4 (0)- 

We introduce the following notation in order to abbreviate formulas, which is also well suited for weak 
convergence type arguments. For a function / : R — > R and a probability measure P on R let 

oo 

(f,P) := J f(x)P(dx). 

— oo 

We also use the following notation: (/ (x) , P) = (/, P). Using this notation, we have m n (t) = (x, /i n (t)}. 

Now we can present the generator of the process acting on the integrated test functions as above. Let Ct, 
denote the space of bounded and continuous functions from R to R, and let Id denote the identity from R to 
R: Id 0) = x. Define 

(2.1) H:={feC b : |/| < 1} U {Id} . 

This set of functions will be important later on. For a function / € H we have: 

L </, Mn (*)) = ((E (/ (x + Z))-f (x)) w(x~ m n (t)) , M „ (t)> , 

where L is the generator of the process. The particular choice of H is so that we can apply the generator L to 
the center of mass m n (t). 

3 Results 

Our main contributions and results are (1) introducing the model, (2) formulating via heuristics the limiting 
behavior as the number of particles goes to infinity and analyzing this limiting behavior, and (3) showing 
rigorously that (under some assumptions on the jump rates) the limiting behavior of the process in the fluid 
limit indeed satisfies the deterministic integro-differential equation that we formulated. This section describes 
in more detail results describing (2) and (3). 

The case of a fixed number n of particles is difficult to treat. Of course the n = 2 case is doable: in Section[H 
we look at two specific cases of the model, and in both we compute the stationary distribution of the gap 
between the two particles. At the end of the section we illustrate the difficulties that arise for more than two 
particles in Figure [5J 
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Our main interest is in the large n limit (so called fluid limit) of the particle system. Let us assume that 
the jump length distribution has a density tp. Heuristically, in the fluid limit the empirical measure fx n (t) of 
the particles at time t becomes an absolutely continuous measure with density g{-,t) whose time evolution is 
given by the so-called mean field equation 

*«(«,*) , ,« , , f , , w ^ 

(3.1) — — — =-w{x-m{t))Q(x,t)+ / w{y-m(t))Q(y,t)<p{x-y)dy, 

— oo 

where 

oo 

(3.2) m(t)= J xg{x,t)dx 

— oo 

is the mean of the probability density g {•,£). A priori we assume nothing of the density g{-,t) other than it 
having a finite mean. The first term on the right hand side of (|3.ip accounts for the particles hopping forward 
from position x at time t: at time t there is a density of g(x,t) particles at x, who jump forward with rate 
w (x — m (t)). The second, integral term on the right hand side of (|3.1[) accounts for the particles who are at 
position y < x jumping forward at time t to position x: at time t there is a density of g {y, t) particles at y, who 
jump forward with rate w {y — m (i)), and need to jump forward exactly x — y to land at x, the probability of 
this being proportional to (f(x — y). 

Our first results concern solutions to the mean field equation (|3.1j) in the form of a traveling wave 

(3.3) g{x,t) — p(x — ct), 

where c is the constant velocity of the wave and p is centered, i.e. J xp (x) dx — 0. Traveling waves of this form 
represent stationary distributions as seen from the center of mass, which travels at a constant speed c. Substi- 
tuting (|3.3[) into the mean field equation ()3.1[) . and then writing x instead of x — ct for simplicity everywhere, 
we arrive at the following equation: 

X 

(3.4) -cp'(x) = -w{x)p{x)+ J w{y)p{y)ip (x- y)dy. 

— oo 

Recall that the jump rate function w is a non-increasing function; any non-constant w gives a strong enough 
attraction to keep the particles together, i.e. there is a probability density traveling wave solution p. This is 
formulated in the following theorem for exponentially distributed jump lengths. 

Theorem 3.1. Consider the specific case when the jump length distribution is exponential with mean one, and 
assume that w is non- constant. Let Cq be the unique solution of 

OO x . . 

(3.5) / xe° Kco ' dx = 0. 

— oo 

Equation (|3.4I) has a centered probability density solution p if and only if c = cq. In this case it is given by 

f(l w (s)-l)ds 

(3.6) p(x) = Ke« yc , 

where K is an appropriate normalizing constant. It is immediate that the probability density p of (|3.6I) has at 
least exponentially decreasing tails. 

There are several specific jump rate functions w for which the resulting stationary distribution is of interest. 
In particular, the case of w (x) — e~@ x , where f} > 0, leads to the generalized Gumbel distribution: 

Corollary 3.2. If w (x) = e~P x , where j3 > 0, then from Theorem \3.1\ we obtain for the stationary density 



(3.7) pp (x) = T r e 

where ip is the digamma function: 



T(x)- 
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Specifically for f3 = 1 the stationary distribution is the centered standard Gumbel distribution. The velocity of 
the wave is 

ft 

The Gumbel distribution often arises in extreme value theory (Gumbel [21]), e.g. this is the limiting distribu- 
tion of the rescaled maximum of a large number of independent identically distributed random variables drawn 
from a distribution that has no upper bound and decays faster than any power law at infinity (e.g. exponential 
or Gaussian distributions). More generally, the kth largest value, once rescaled, follows the generalized Gumbel 
distribution (Bertin [7], Clusel and Bertin [10], Gumbel [21]). Therefore the question arises whether there is an 
underlying extremal process in the dynamics. It turns out that there is a connection between the mean field 
model and extreme value theory: this is discussed in Section 15721 

The validity of the mean field equation is strongly supported by simulation results for various special cases 
of the model, see [3] Section 7]. We expect this to be true in quite some generality (for fairly general jump rate 
functions and jump length distributions). 

The main result of the paper is to rigorously validate this approximation. That is, our goal is to show that in 
the fluid limit — when the number of particles tends to infinity — the mean field equation (|3.ip holds. However, 
when trying to rigorously prove this, technical difficulties may arise. Therefore we impose some restrictions on 
the jump rate function and the jump length distribution. Our main restriction is that we suppose that w is 
bounded: w (x) < a. We also impose some additional (smoothness) restrictions on w, but boundedness is the 
main restriction. Our restrictions on the jump length distribution are light: we assume that it has a finite third 
moment. 

In order to precisely state our main theorem, some preparations are needed. First notice that the mean 
field equation (|3.1[) can be rewritten as the evolution of a probability measure fj, (•) using test functions from H. 
Moreover, we do not have to assume that the jump length distribution has a density either. Define 

t 

At j {n (•)) : = (*)) - (/, H (0)> - / <(£ (/ (x + Z))-f (x))w (x - m (s)) , M (s)) ds 

(3.8) 



n 

/ 



(/,M(*)>-</,M(0)>- / L(f,n(s))ds 



o 



where 



m(t) = (x,(J,(t)) . 
We say that fj, (•) satisfies the mean field equation if 

(3.9) A tJ (»(■)) = 

for every t > and / G H . 

The position of the center of mass of the particles plays an important role in our model, and therefore we 
always assume it exists (and is finite). Consequently, it is natural to look at fj, n (t) as an element of V\ (R), the 
space of probability measures on R having a finite first (absolute) moment. In order to talk about convergence 
in V\ (R) we need to define a metric on V\ (R). We use the so-called 1-Wasserstein metric. For //, v G V\ (R) 
define the set of coupling measures 

n (/x, v) := {tt G Mi (R x R) : tt (• X R) = fi, tt (R X •) = v) , 

where Mi (R x R) denotes the space of probability measures on R x R. The 1-Wasserstein metric on the space 
V\ (R) is defined as 




(3.10) d\ (fi, v) := inf < / \x — y\ tt (dx, dy) : n G II (/x, v) 



for /j,, v G V\ (R). Much is known about the metric space (Pi (R) , di); we will state what we need later. 

The natural space in which to consider /x„ (■) is the Skorohod space D ([0, oo), V\ (R)): we know that (j, n (•) is 
a piece- wise constant function in V\ (R), having jumps whenever a particle jumps. Similarly, for any test function 
/ (for instance / G H), the natural space in which to consider (/, fi n (•)) is the Skorohod space D ([0, oo), R). 

Let S be a metric space and let P n and P be probability measures on 5*. We say that P n converges weakly 
to P, denoted by P n =>„ P, if J s fdP n — > J s fdP for every bounded, continuous real function / on S. If X n 
and X are random elements of S, with X n distributed according to P n and X distributed according to P, then 
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we say that X n converges weakly to X in S, denoted by X n =>■„ X, if P n =>■„ P. For more on weak convergence, 
see |S]. 

In order to prove weak convergence of {/i n ( - )} n >i m t ne appropriate Skorohod space, we have to make the 
following assumptions on the initial distribution of the particles. 
Assumption 1. Our first assumption is that 

(3.11) lim limsupP f - V \x t (0)1 > L ) =0. 

L^oo n ->-oo \n ^ I 

Assumption 2. Our second assumption is that there exists a constant c such that for every e > 

supP ( ^Xl 1 W > & - £ / 4 ) - £ / 3 - 



n ■ 

i=l 



(Note: the exponent 2/3 has no special significance. This is simply because we assume E (-Z 3 ) < oo for simplicity. 
We could assume E (Z 2+e ) < oo for some small e > 0, in which case we could replace the exponent 2/3 with an 
exponent of 2/ (2 + e) in Assumption 2.) 

A sufficient and simple condition for both Assumption 1 and 2 to hold is that 



supE (^J2\x t (0)| 3 ^j < oo. 



Assumption 3. Our third assumption is that 

(3.12) ti n {Q)^ n u 

in V\ (M), where v £ V\ (M) is a deterministic initial profile. 
We are now ready to state our main theorem. 

Theorem 3.3. Suppose that the non-increasing jump rate function w is bounded, and that it is differentiable 
with bounded derivative. Suppose the jump length Z is scaled such that EZ = 1 and that it has a finite third 
moment. Suppose that Assumptions 1, 2 and 3 from above hold. Then 

(3.13) M')=>nM(-) 

in D ([0, oo), V\ (R)), where fi (•) is the unique deterministic solution to the mean field equation (|3.9p with initial 
condition v. 

The proof of this theorem is in Section [6] The argument follows the standard method for proving weak 
convergence results. Namely, based on Prohorov's theorem, we need to do three things: 

• Prove tightness of {/i„ ( - )} n >i m tne Skorohod space D ([0, oo), V\ (K))- 

• Identify the limit of {/i n (')}n>i : here we essentially have to show that the limit solves the deterministic 
mean field equation. 

• Prove uniqueness of the solution to the mean field equation with a given initial condition. 



3.1 Open questions 

We end this section with some open questions. 

• Theorem 13.31 is a law of large numbers type result, so it is natural to ask about the fluctuations around 
this first order limit. In particular, how does the variance of the empirical mean m n (t) scale with n and 
t? Preliminary simulations suggest that this variance grows linearly in t, and for fixed t decays in n as 
n~ a for some a > 0. (The simulations are inconclusive of what the exponent a should be.) 

• Once the order of fluctuations of the empirical mean m n (t) is known, can we determine the limiting 
distribution of m n (t) with appropriate scaling? 

• Going back to the finite particle system: can we really not deal with a fixed number (n > 3) of particles 
and determine the stationary distribution viewed from the center of mass in this case? 

• In Theorem 13.31 we make assumptions on the jump rate function w, the jump length distribution, and the 
initial distribution of the particles. Most of the assumptions are fairly weak, except for the assumption 
that w is bounded, which is the main restrictive assumption. We believe that these assumptions can be 
weakened, and in particular we are interested in removing the assumption that w is bounded. Currently 
all three steps of the proof (tightness, identifying the limit, uniqueness of the limit) use estimates which 
involve the bound on w. One crucial thing that would be needed is control over the probability of a 
particle "falling behind" by a lot. Can one prove the same result for unbounded w, e.g. w (x) = e~ x ? 
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4 Two-particle system 



As an introduction to the model, we look at the simplest case: when n = 2, i.e. when there are two competing 
particles. In particular, we look at two specific cases of the model, and in each case we compute the stationary 
distribution of the gap between the two particles. At the end of the section we illustrate the difficulties that 
arise in the case of n > 3 particles. 

4.1 Deterministic jumps 

First, let us look at the model when the length of the jumps is not random but a deterministic positive number. 
We always have the freedom to scale the system so that this length is equal to unity. So, in this subsection, we 
look at the model with deterministic jumps of length 1. 

In the case of two particles, x\ (t) and xi (i), we are interested in how far away they move from each other, 
thus in the gap g (t) — \x\ (t) — £ 2 (t)\- For simplicity let us suppose that X\ (0) — x-i (0) £ Z, so that g (t) £ N 
for all t > 0. In this case g (t) is a continuous-time Markov process on N, with infinitesimal generator Q given 
by the model in the following way. Since the gap can only change by 1, Q is a tridiagonal matrix. If the gap is 
0, i.e. x\ (t) — X2 (t), then both particles jump with rate w (0) and whichever one does, the gap changes to 1. 
Therefore the rate at which the gap changes from to 1 is 2w (0) and thus 

Qo,o = -2w (0) , Q„,i = 2w (0) . 

If the gap between the particles is k > 0, then the one in the lead jumps with rate w (-|), while the one which 
is behind jumps with rate w (—5). This means that the gap increases by 1 with rate w (§) and decreases by 1 
with rate w (— |) and so 

Qk,k-i =w (~) , Qk,k = - ( w ( ~ ) + w ( ^ ) ) , Q k ,k 1 



2/' " ' V V 2/ \ 2 J J "" V " VTX ^2 

When the jump rate function w is a monotone decreasing, non-constant function, then g (t) is a positive 
recurrent birth-and-death process with stationary distribution n = (t:q , tti , 7T2, . . . ) given by 

1 

7T0 = 



7T fc = 27r n*=o lw (g) forfc >i. 

In particular, if the jump rate function is exponential, i.e. w (x) — e~$ x , where j3 > 0, then the stationary 
distribution shows Gaussian decay, while if the jump rate function is a step function, 



a if x <C 0, 
b if < x. 



w (x) 

where a > b > 0, then the stationary distribution shows exponential decay; see [21 Section 4.1] 



4.2 Jump length distribution having a density 

Consider again the case of two particles, but now let us suppose that the distribution of the length of the 
jumps has a density, and denote this by (p as mentioned in Section [5J As before, we are interested in the gap 
g (t) — \xi (t) ~ x 2 (t)| between the two particles. Denote by p (g, t) the probability density of the gap g at time 
t, which is concentrated on [0, 00). The time evolution of the gap is completely described by the time evolution 
of p, which is described by the master equation 

s 

P (.9) =~P (.9) (w (-.9/2) + w (5/2)) + J p{y)w (y/2) tp (g - y) dy 



00 00 
+ J p(v)w(-y/2)<p(y - g)dy + J p(y)w(-y/2)ip(y + g)dy, 
g 

where the overdot denotes time derivative. The first term on the right hand side accounts for the case when 
the gap length is g and one of the two particles jumps (Fig. QJi andHJ)). The second term accounts for the 
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(a) 
(b) 




Figure 1: Events that change the gap length. 

case when a gap of length g is created from a previous gap of length y < g due to the leader jumping forward 
(Fig. [TJ;). The third term accounts for the case when a gap of length g is created from a previous gap of length 
y > g due to the laggard jumping forward, but still remaining the laggard (Fig. [TJi). The fourth term accounts 
for the case when a gap of length g is created from a previous gap of length y > due to the laggard jumping 
forward and overtaking the leader (Fig. [T£). It is easy to check that this conserves the total probability, i.e. 

f™p(g)dg = 0. 

Our goal is to compute the stationary density, i.e. p such that p (g) = 0. It is unlikely to be able to find a 
closed formula for the stationary density in full generality (for general ip and w), and so we restrict ourselves to 
the special case when (p(x) = e~ x l[x > 0] and w (x) = e~P x , where > 0. 

In this case it can be shown (see [3J Section 4.2] for a detailed calculation) that the stationary density of the 
gap is 

(cosh(f)) 

with c being the normalizing constant. In particular, in the special case of /3 = 2, the stationary density is 



'(<?) 



i 



cosh (g) 



for g > 0. 



4.3 Many-particle case 



In the n > 3 case with deterministic jumps, the particles Xi (t) , . . . , x n (t) form a continuous-time Markov 
process on an n dimensional lattice. Viewed from the mean position of the particles, this becomes a Markov 
process on an n — 1 dimensional lattice. However, even in the n = 3 case the combinatorics of the problem 
becomes too difficult to find a closed form for the stationary distribution (see Figure [5]) . The model with the 
jump length distribution having a density presents similar difficulties in the n > 3 case. (As we have seen, the 
problem is not completely solved even in the n = 2 case.) Understanding the finite particle case is the subject 
of future work. 



5 Mean field 

We now turn our attention to the mean field approximation of the model and study the solutions of the mean 
field equation (|3.1[) . First of all, using the fact that ip is a probability density function, we can check that the 
mean field equation (|3.ip conserves the total probability, i.e. dg< g t ^ dx = 0. 

An important quantity is the speed of the mean of the density. This speed can be expressed as a simple 
integral by differentiating equation (|3.2p with respect to time and using the mean field equation p. ID : 



■(*) = 



x 9 -^dx 
at 



= — I xw (x — m (t)) g (x, t) dx + / / xw (y — m (t)) g (y,t) ip (x — y) dydx 



— oo — oo 



xw (x — m (t)) g (x, t) dx + j w (y — m (t)) g (y,t) J xip(x — y)dxdy 

y 

oo 

xw (x — m (t)) g (x, t) dx + / w (y — m (t)) g (y,t) dy + / yw (y — m (t)) g (y,t) dy 



8 



Figure 2: In the n — 3 case with deterministic jumps the particles viewed from the mean position form a 
continuous-time Markov process on a 2 dimensional lattice with directed edges, as seen above. The coordinates 
in the nodes show (3 {x\ (t) — m n (t)) , 3 (#2 (i) — m n (t)) , 3 (23 (t) — m n (t))). (We multiply by 3 only so that 
we need not write fractions in the figure.) The rate of going from one node to another on a particular edge is 
indicated on the edge itself, however, once again the argument of the jump rate function w (■) is multiplied by 
3 so that we do not need to write fractions in the figure. The edges going outward from the origin are colored 
in blue, the ones which are going inward toward the origin are colored in red, while the remaining edges are 
colored green. Since the rates on the incoming red arrows are larger than those on the outgoing blue arrows, one 
can see that there is a drift towards the origin, and therefore one expects there to be a stationary distribution 
on this lattice. However, there can be no detailed balance since there are directed cycles in this graph, and 
therefore calculating the stationary distribution becomes difficult. 
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(5.1) = / w(y-m(t))g(y,t)dy, 



where we used that xip (x) dx — 1, as assumed in the model. When m (t) — ct and g(y,t) — p(y — ct), 
equation (|5.1|) becomes 

oo 

(5.2) c= / w{x)p{x)dx. 



We now turn to proving Theorem 13. II 
Proof of Theorem \3.1\ Substituting ip(x) = e~ x l[x > 0] into (|3.4p we get 

X 

(5.3) — cp' (x) = — w (x) p (x) + / w (y) p (y) e~^ x ~ y ^ dy 



First, we take the derivative of the integral term in (|5.3I) . Using the property of the exponential function, that 
its derivative is itself, we get 

(X \ X 

J w(y)p{y)e- {x - y) dy\ =w(x)p(x)- J w (y) p (y) e - {x - y) 'dy 
-oo / — oo 

= w (x) p (x) — w (x) p (x) + cp' (x) = cp' (x) , 
where we used (15.31) in the second line. It follows that for some constant D we have 



X 

I 



w (y)p(y) e ^ v ^dy — cp(x) + D. 



Plugging this into (15 ,3|) we arrive at 

(5.4) — cp' (x) + w (x) p (x) — cp (x) + D. 

To show that D = 0, let us integrate both sides of the equation with respect to x from — oo to oo (by integrating 
from —R to R and taking the limit as R — > oo). The integral of the first term on the left hand side is since 
the total probability is conserved by the mean field equation, and the integral of the second term on the left 
hand side is c according to (|5.2[) . Since p is a density we also have that the integral of the first term on the 
right hand side is c as well, therefore 

R 

lim / Ddx = 0, 

R— >oo J 
-R 

and thus D = 0. Now rearranging (|5.4[) we get 

P 1 (x) = (^w(x) - lj p{x) , 

from which we arrive at (13.6[) . □ 
We refer to [31 Section 5] for an alternative proof of Theorem 13. II by LP. Toth. 

5.1 Specific examples 

In this subsection we look at the stationary distribution p of (|3.6[) in the case of two specific jump rate functions. 
In both cases we also determine the speed of the traveling wave from Theorem 13. II Details of calculations are 
given only in the case of Example 15.11 the calculation in the other example is quite simple from Theorem 13.11 



Example 5.1 Exponential jump rate function 

If w (x) = e~P x , where /3 > 0, then from Theorem 13. II we obtain 

pp (x) = Ke' x ~ e 1 
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where K is an appropriate normalizing constant. Integrating this over R gives us the value for K and thus the 
density becomes 



To check this let us calculate the integral: 



pp [xj dx = I — T^ e { ' ' dx 

r(4 



oo 



' f " ' 'dy =P[ z te-'(-¥\=-±rl z^e-'dz = 1, 



r (*)_{, V fiz) r(i) o 

using the substitutions y = x + los ^ c/3 ) and z — e~@ v . From Theorem 13. II the speed of the wave c is determined 
by the fact that pp is centered, so to compute c we must compute the mean of the density. Using similar 
substitutions as before we get: 



l) 



log (c/3) /3 /"log (z) i _ z ( dz\ log (c/3) 



1 

7 



'(*) 



oo 



r 

log (c/3) + 



(*) 



which is zero when 



where ip is the digamma function: 



Therefore the stationary distribution in this case is given by p.7[) . Note the exponential tail of pp on one side, 
and the double-exponential tail on the other side. Specifically, in the case of [3 = 1 the stationary distribution 
is the centered standard Gumbel distribution: 



pi ( X ) = e -(*+7)-e- { - +T) , 

where 7 is Euler's constant. Moreover, it can be seen that pp is a rescaled generalized Gumbel distribution with 
parameter 1/(3 (Bertin [TJ, Clusel and Bertin [TU], Gumbel [21] )• As mentioned in Section [3J the (generalized) 
Gumbel distribution often arises in extreme value theory, so it is natural to ask whether there is an underlying 
extremal process in the dynamics. It turns out that there is a connection between the mean field model and 
extreme value theory when k — 1//3 is an integer: this is discussed in Section HT2l 

Example 5.2 Jump rate function is a step function 
Let 

la if x < 0. 

w (x) = < 

W [b if 0<x, 

where a > b > 0. Then from Theorem 13. II the solution to (|5.3I) is 



Pa,b (x) 

where the normalizing constant is 



Ke^- 1 )* if a; < 0, 



K 



^Kei^- 1 )* if < 

(a — c) (c — 6) 
c (a — 6) 
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The speed of the wave follows from Theorem 13. II 



and thus the stationary distribution is 

(5.5) ^ = WTFf^ 

which is a Laplace distribution. 

For two further examples, see [3] Section 5.1]. 

5.2 Connection with extreme value theory 

In the previous subsection we have seen that when ip(x) = e~ x l[x > 0] and w (x) = e~@ x then the (rescaled 
generalized) Gumbel distribution arises as the stationary distribution of the mean field system. This is inter- 
esting, because at first sight there is no extremeness in the model. In this section we discuss the connection 
between the model and extreme value theory. (Here we occasionally neglect mathematical precision for the sake 
of clarity of the argument.) 

Let us start with the standard Gumbel case, i.e. @ = 1. Recall that according to our observation in the 
introduction, we can assume that the center of mass moves with a constant velocity c, and we consider a single 
mean field particle X(t) that jumps with rate e ct ~ x( ^\ so 

P {X (f) changes between t and t + dt) = e ct ~ x(t) dt + o (dt) . 

When this jump occurs, X(t) increases by a random number drawn from an exponential distribution with 
parameter 1 (denoted by Exp(l) in the sequel). 

How can we view this as an extreme process? The answer to this question was given by Attila Rakos, 
whose idea is the following. As time goes forward, let us take gradually more and more new independent and 
identically distributed Exp(l) random variables, and let us look at the distribution of the maximum of these 
random variables, which we denote by Y (t). Specifically, at time t let there be N (t) = e ct /c random variables 
altogether, so Y (t) is the maximum of A (t) i.i.d. random variables. There are A' (t) dt = e ct dt new random 
variables trying to "break the record" between t and t + dt, and thus 

/ v , s\ N'(t)dt 

P (Y (t) changes between t and t + dt) = 1 - 1 - e~ Y[t) 



= N' (t) dte~ Y ^ + o (A' (t) dte~ Y ^ 
= e ct - Y ^dt + o(e ct - Y ^dt) , 



and if Y (t) changes, then it jumps according to Exp(l) . Here we use the memoryless property of the exponential 
distribution, i.e. the fact that if T is exponentially distributed then 

P (T > s + 1 | T > s) = P (T > t) . 

Thus we can conclude that the two processes are basically the same. Furthermore, we know that Y (t) — 
log A (t) follows the standard Gumbel distribution for large t (see, for example, Gumbel [21] or the excellent 
survey by Clusel and Bertin [TO]]): 

lim P (Y (t) - log A ft) < x) = e _e ~ x , 

t— >oo 

and therefore since log A (t) = ct — log c, it follows that 

lim P (Y (t) -ct<x)= e -e- l * +1 ° sc) . 

t— >oo 

Thus by the connection we have 



lim P (X (t) -ct<x)=e 



—e 



t—too 

and since the distribution of X (t) — ct is centered, it follows that 
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where 7 = 0.5772 ... is Euler's constant. 

The idea above extends to the general case when k — 1/(3 is a fixed positive integer, with an appropriate 
definition of Y (t) . In the following we use k and 1/(3 interchangeably. 

We define Y (t) as follows. Once again, as time goes forward, let us take gradually more and more new 
independent and identically distributed Exp(l) random variables. Specifically, at time t let there be N (t) = 
e^ ct I (3c random variables altogether. Denote by Yfc (t) the k th largest value among these random variables at 
time t, and define Y (t) := kY^ it). With this definition, the rest of the calculation is done in a similar way to 
the standard Gumbel case. For a more detailed calculation, see [3J Section 5.2]. 

Thus we have rediscovered the results of the previous subsection through extreme value theory arguments. 

6 Fluid limit 

In this section we prove Theorem 13.31 The section is organized as follows. In order to present and prove our 
results, we need to introduce notation, terminology, definitions and previously known results. This necessary 
background is introduced in Section 16.11 Then in Section 16.21 we formulate as separate theorems several in- 
termediate steps that lead to the main result and we prove Theorem 13.31 using these steps. In the following 
subsections we then provide proofs of the intermediate steps stated in Section |6~2"1 

6.1 Preliminaries 

Recall that our goal is to show that — given some assumptions — the path of the empirical measure process 
{fi n (•)}„>! converges weakly in the Skorohod space D ([0, 00), Pi (R)) to fi(-), which solves the mean field 
equation (13.9[) . The goal of this subsection is to provide all the necessary preliminaries that we need during the 
proof. 

We present important results from the theory of weak convergence that we use later; in particular, we focus 
on tightness results in Skorohod spaces. For an excellent introduction to weak convergence, we refer the reader 
to Billingsley [S]. Many results that we use are taken from Ethier and Kurtz [T5|, and Jacod and Shiryaev [35] 
is also a valuable source of information concerning convergence of processes. 

As in Theorem 13.31 from now on we assume that the jump rate function w is bounded, and we denote its 
lowest upper bound by a. To put it another way: a := lim a; _j._ 00 w (x) = sup^. w (x). We also assume that w is 
differentiable and has a bounded derivative. Furthermore, in the following Z always denotes a random variable 
from the jump length distribution. As before, we assume that E (Z) = 1 and that Z has a finite third moment. 

6.1.1 Probability metrics 

Recall the definition (|3 . 10[) of the 1-Wasserstein metric. The following lemma (from Feng and Kurtz [T4l 
Appendix D]) summarizes what we need to know about the metric space (Pi (R) ,g?i). More can be found in 
e.g. Feng and Kurtz [TJJ Appendix D]. 

Lemma 6.1. (Pi(R),<ii) is a complete and separable metric space. For jit n ,/i G Pi (R), the following are 
equivalent: 

( 1) lim, woo di (fin, fi) = 0. 

(2) For all continuous functions tp onM. satisfying \ip (x)\ < C (1 + |ar|) for some C > 0, 

lim (tp, fj, n ) = (ip., . 

n— >oo 

We introduce another probability metric that we use later on. For P, Q G Pi (R) and a set H of functions 
define 

(6.1) d H (P,Q) = sup \(f,P)-(f,Q)\. 

fen 

It is immediate that du is a pseudometric on Pi (R): du (P, P) = 0, d-u (P, Q) — d-u (Q, P), and the triangle 
inequality also holds. The only thing that does not necessarily hold is that d-u (P, Q) = implies P = Q. In 
order for this to hold, T-L has to be a large enough set of functions to be able to distinguish between different 
probability measures. There are various choices of T-L that give rise to notable metrics. For instance, if T-L is the 
set of indicator functions of measurable sets, then d-u is the total variation metric. If Ti is the set of half-line 
indicator functions then d-u is the Kolmogorov (uniform) metric. The choice of Ti = {/ G Cf, : |/| < 1} gives 
the Radon metric (that this is a metric follows from e.g. Billingsley [5} Theorem 1.2.]). Consequently the choice 
Ti = H (see (|2.ip ) also yields a metric. This is what we use in the following: dn denotes the metric on Pi (R) 
given by (|6.ip with Ti = H. This metric is used in Theorem l6.14l Corollary 16 . 151 and their proofs in Section [6751 
For more on probability metrics we refer the reader to Gibbs and Su [17] for an excellent review. 
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6.1.2 Skorohod spaces 

Let (E, r) be a general metric space. Denote by C ([0, oo), E) the space of continuous E'-valued paths with the 
uniform topology. The Skorohod space D([0,oo),E) consists of E- valued paths that are right-continuous and 
have left limits. We do not go into details about the topology on D ([0, oo),E), but we note that there exists a 
metric under which D ([0, oo), E) is a complete and separable metric space if (E, r) is complete and separable 
(Ethicr and Kurtz [T31 Chapter 3]). The topology generated by this metric is called the Skorohod topology on 
D ([0, oo), E). Note that C ([0, oo), E) is a subset of D ([0, oo), E) and it can be seen that the Skorohod topology 
relativized to C ([0, oo), E) coincides with the uniform topology there. For more on the Skorohod space we refer 
the reader to Billingsley [5J Chapter 3] and Ethier and Kurtz [T31 Chapter 3]. 

The reason we introduced the Skorohod space in general is because the natural space in which to consider 
//„(■) is D ([0, oo), V\ (R)): we know that fi n (•) is a piece-wise constant function in V\ (R), having jumps 
whenever a particle jumps. (We can define fi n (■) to be right-continuous at its discontinuities.) Since V\ (R) is 
complete and separable under the Wasserstein metric, D ([0, oo),Pi (R)) is also complete and separable under 
an appropriate metric. Similarly, for any test function / (for instance / G H), the natural space in which to 
consider (/,//« (■)) is the Skorohod space D Q0,oo),R). 



6.1.3 Tightness in Skorohod spaces 

Recall our plan of action from the end of Section [3] 

• Prove tightness of {fj, n (-)} n >i m the Skorohod space D ([0,oo),7- , i (R))- 

• Identify the limit of (')}„>i : here we essentially have to show that the limit solves the deterministic 
mean field equation. 

• Prove uniqueness of the solution to the mean field equation with a given initial condition. 

In view of the first point, our goal here is to find checkable sufficient conditions for tightness in Skorohod spaces. 
Since all metric spaces we deal with are complete and separable, by Prohorov's theorem the concepts of tightness 
and relative compactness are equivalent in our setting, and therefore we use these two notions interchangeably. 
Following Perkins [26] . we say that a sequence is C-relatively compact if it is relatively compact and all weak 
limit points are a.s. continuous. 

Tightness in D ([0, oo), R) 

First, we state conditions for tightness in D ([0, oo), R). For y (•) G D ([0, oo), R) define 
(6-2) Jt(v(-))= sup \y(t)-y(t-)\, 

0<t<T 

the largest (absolute) jump in y (•) until T. Let {Y n (-)}„>i be a sequence of random variables in D ([0, oo),R). 
The following theorem, which is a combination of Billingsley [8] Theorem 16.8.] and the corollary thereafter, 
characterizes tightness in D ([0, oo), R). 

Theorem 6.2. The sequence {Y n (-)} n >i * s tight in D ([0, oo),R) if and only if the following three conditions 
hold: 

(i) 

(6.3) lim limsupP(|r„ (0)| > L) = 0. 



(ii) For each T > 0, 
(6.4) 



lim limsupP(J T (Y n (•)) > L) = 0. 



(Hi) For each T > and e > 0, 

(6.5) lim lim sup P inf max sup \Y n (t) — Y n (s)| > e I = 0, 

<5^0 n ^oo \ {tj} l<i<v aite [i 4 _ 1)t4 ) J 

where the infimum extends over all decompositions [tj— i, U), 1 < i < v, of [0, T) such that ti — ti-i > 5 for 
1 < i < v. 
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We now look at a condition which implies that any limit point is continuous, i.e. is in C ([0, oo), R). Following 
Ethier and Kurtz [T3J p. 147], for y (■) € D ([0, oo), R) define 



(6.6) J (y (•)) = / e- u min { J„ (y (•)) , 1} du, 

o 

where </«(?/(•)) is defined as in (|6.2p . The following theorem characterizes continuity of a weak limit point 
(Ethier and Kurtz [Ml Theorem 3.10.2.]). 

Theorem 6.3. Suppose Y n (•) =>„ Y (•) in D ([0, oo),R). TTien Y (•) is almost surely continuous if and only if 
J(Y n (-))^ n 0. 

We now give sufficient conditions for a sequence to be C-relatively compact. 
Theorem 6.4. Suppose the following two conditions hold. 

(i) 

(6.7) lim limsupP(|r„ (0)| > L) = 0. 



(ii) For each T > and e > 0, 



(6.8) 



lim lim sup P sup \Y n (t) - Y n (s) \ > 

<5->0 n _j.no I 0<t-s<(5 
V0<s<t<T 



Then {Y n (-)} ra >i * s C-relatively compact in D ([0, oo),R). 

This theorem is stated in Grigorescu and Kang [201 Section 3] , but without a proof, and so for completeness 
we provide a short proof. 

Proof. Let us check tightness first: according to Theorem 16.21 we need to check that (|6.3[) . (j6.4[) . and (|6.5[) 
hold. Our first condition is exactly (|6.3I) . From (|6.8I) it follows that for every T > 0, Jt (Y n (•)) 0, and 
consequently (|6 .4[) holds. To show that (|6.5I) follows from (|6.8p . note that 

inf max sup (t) - F n (s)\ < sup |F„ (t) - Y n (s)\ , 

{ti}l<t<«ate[i 1 _i,t 1 ) 0<t-s<25 

0<s<t<T 

where the infimum extends over the same range as in Theorem 16.21 

Now let us check that any weak limit point is a.s. continuous. According to Theorem 16.31 we need to check 
that J (Y n (■)) =>„ 0. From (|6.8p we know that for every T > 0, J T (Y n (•)) =>„ 0. For any fixed e > there 
exists a T such that e~ T < e/2. From (|6.6I) it follows that J (Y n (•)) < TJt (Y n (•)) + e~ T , and consequently: 

lim P (J (Y n (■)) > e) < lim P (J T (Y n (■)) > e/2T) - 0. □ 

n— >oo n— >oo 

Tightness in Skorohod spaces containing measure-valued paths 



Now let us turn to conditions that imply tightness in the Skorohod space of time-indexed measure-valued 
paths D ([0, oo), V\ (R)). Our main tool is a slight modification of a theorem of Perkins [551 Theorem II. 4.1]. 
Perkins' theorem basically states that if we can check an extra condition, then C-relative compactness in 
D ([0, oo), Mp (E)) (where Mp(E) is the space of finite measures on the metric space E, with the topol- 
ogy of weak convergence) can be checked by checking the C-relative compactness of appropriate integrals in 
£>([0,oo),R). 

Our modification of Perkins' theorem is the following. 

Theorem 6.5. A sequence of cadlag V\ (M)-valued processes {P n (•)}«>! * s C-relatively compact in 
D ([0, oo), Vi (R)) if the following conditions hold: 

(i) For every T > there exists a c — c (T) and a r = t (T) G (0, 1) such that for every e > there exists a 
K = K T ,s 6 R such that K < c/e 1 " T and 

supP( sup P n (t, (-oo, -if) U (K,oo)) > e ) < e. 

n \0<t<T J 
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(ii) For every f S Cj,> {(f,P n (*))} n >i * s C -relatively compact in D([0,oo),K). 

If one looks at the proof of Perkins' original theorem ^Zjo\ Theorem II.4.1], then one can see that it holds 
true if one replaces Mp (E) with Mi (E), the space of probability measures on E with the topology of weak 
convergence. However, our measure-valued process lives in a different space, namely fx n (•) £ D ([0, oo), V\ (M)), 
where the Wasserstein space V\ (M) is endowed with the metric d\. It turns out that we cannot simply replace 
Mp (E) with V\ (K) in the theorem. Convergence in (Pi (R) ,d±) is stronger than weak convergence: it is weak 
convergence and the convergence of the first absolute moment of the measure. It is this that causes the original 
proof of the theorem to break down. 

The proof of Theorem 16.51 is essentially the same as that of Perkins' theorem, but with a slight modification 
at one point, which requires that our condition (i) is stronger than in Perkins' original version. For completeness, 
we reproduce the whole proof, following the lines of Perkins [26j Section II. 4.]: this can be found in Section IB~6l 

We use Theorem l6.5l in order to prove tightness of the empirical measure process {/i n (•)}„> i m Corollarv l6.10l 

6.1.4 A few more facts that are used 

Before we move on to our results, let us finish the preliminaries with a few more things that we use later in the 
proofs (in particular in Section lo~4")) . We rely on the continuous mapping theorem (Billingsley [5J Section 2]). 
We do not need the theorem in full generality, only the following: 

Theorem 6.6. Suppose h is a continuous function from the metric space (S,d) to the metric space (S',d'). If 
X n =^„ X in (S, d), then h (X n ) =^„ h (X) in (S', d') . 

Finally, we need the following two lemmas, which can be found as exercises in Ethier and Kurtz [T51 
Chapter 3]. 

Lemma 6.7. Let E and F be metric spaces, and let f : E —> F be continuous. Then the mapping n/oi 
from D ([0, oo), E) to D ([0, oo), F) is continuous. 

Lemma 6.8. The function 



/(*)(•) = 




o 



from D([0,oo),R) to D ([0, oo), R) is continuous. 



6.2 Main steps of the fluid limit procedure 

In this subsection we collect the intermediate steps that build up the proof of the main theorem, Theorem 13. 31 
leaving the proofs of these results to later subsections. We complete the proof of Theorem 13.31 at the end of 
the subsection. Recall our assumptions on the jump rate function w and the jump length Z which we assume 
throughout this section. 

Our first results deal with showing the tightness of {/i n (-)} n >i m * ne Skorohod space D ([0, oo), Pi (M)). 

Theorem 6.9. Suppose Assumption 1 from Section\3\ holds. Then for any f G Cf, the sequence {(/, \i n (•))} n >i 
is C ' -relatively compact in D ([0, oo), K). 

We remark that although we only need the above result for continuous and bounded functions in the corollary 
below, continuity is not used in our proof, and with a slight modification in the proof the same can be proven 
for Lipschitz continuous functions. 

Corollary 6.10. Suppose Assumptions 1 and 2 from Section^ hold. Then {/i n (•)} n >i * s C -relatively compact 
in D ([0, oo), Pi (R)). 

Theorem 16.91 and Corollary 16.101 are proved in Section 16.31 

We remark that tightness can be proved in other ways as well, using the techniques of Ethier and Kurtz |13l 
Chapter 3]. In particular, Thomas G. Kurtz communicated to us a different way of checking tightness, and we 
have included a sketch of his proof at the end of Section 16.31 

Our next group of results is concerned with identifying the limit of the sequence {/i„ (•)}«>! • In particular, 
our goal is to show that any weak limit point of {/i n (*)}n>i s °l ves the mean field equation. We prove this in 
two steps. Recall the definition (12.11) of H . 

Theorem 6.11. For every t > and every f € H, 

(6.9) sup \A sJ (m„ 0)1 — 

0<s<t n^oo 

F 

where > denotes convergence in probability. 
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Theorem 6.12. If fj, n (•) ==>„ fi (•) in D ([0, oo),Vi (R)), then for every t > and every f G i?, 
(6-10) A tJ ([i n (-))^ n A t , f (»(■)) 

in R. 

An immediate corollary of these two theorems is the following. 

Corollary 6.13. Any weak limit point //(•) of {//„ (•)} ra >i * n ^ e Skorohod space D ([0, oo), V\ (R)) solves the 
mean field equation almost surely. That is, if fi n (•) /i (•) m I? ([0, oo), V\ (R)), i/ien /or every t > and 
every f G H , A t j (p (•)) = almost surely. 

The proofs of Theorem 16. Ill Theorem 16. 121 and Corollary 16 . 1 31 can be found in Section 

Our final result concerns the mean field equation: the solution with a given initial condition is unique. 

Theorem 6.14. Suppose pi 1 (•) and pi 2 (•) are solutions to the mean field equation with initial conditions /^J and 
fj,Q respectively. Denote by dn (t) the distance dn (m 1 W , A* 2 (t)) of the two measures at time t according to the 
metric defined by (|6.1[) with TL = H . Then there exists a constant c such that 

(6.11) d H (t) <d H (0)e ct . 

Corollary 6.15. Suppose /i 1 (•) and [i 2 (•) are solutions to the mean field equation with the same initial condi- 
tion. Then pi 1 (•) = /i 2 (•), i.e. the solution to the mean field equation with a given initial condition is unique. 

Let us now prove our main result. 

Proof of TheoremlKM The sequence {/i„ (-)} n >i is tight in D ([0, oo), V\ (R)) due to Corollary IBTU1 Corol- 
lary 16.131 shows that any limit point satisfies the mean field equation. Due to Assumption 3, any limit point 
satisfies the mean field equation with deterministic initial condition v. We know that the solution to the mean 
field equation with a given initial condition is unique fCorollary |6.15[) . Consequently, due to Prohorov's theorem, 
/^n (•) =^n M (•)> since we have proved tightness and the uniqueness of any limit point. □ 

6.3 Tightness 

In this subsection we prove Theorem 16.91 and Corollary 16.101 

Before starting the proof, let us introduce a particle system {xi (-)}™ =1 , related to {x{ which we 

use in the proof. Initially, the two particle systems have the same configurations: Xi (0) = Xi (0) for each 
i. Let {xi (•)}"=! be a particle system in which all the particles are independent from each other, all of the 
particles jump with rate a (recall that a = lim a; _ ! ._ 00 w (x) = swp x w (x)), and whenever they jump, the jump 
length is a random variable from the same jump length distribution of the original system, independently of 
everything else. In other words, let Si (•) be independent Poisson processes with rate a for each i , let £J, • • • 
be i.i.d. random variables from the jump length distribution, and define Xi (t) — Xi (0) + Ylk=i Moreover, 
{xi (OlILi an d {xi (OlILi can be constructed jointly on the same probability space in the following way. Define 
{it (•)}"=! exactly as above, and whenever a particle Xi jumps, let Xi jump the same jump length with probability 
w [xi (t) — m n (t)) /a; with the remaining probability let Xi stay in place. This joint construction leads to two 
advantageous monotonicity properties: first, 

Xi (t) > Xi (t) 

for any i and t > 0, and second, 

(6.12) Xi (t) - Xi (s) > Xi (t) - Xi (s) 
for any i and t > s > 0. 

Proof of Theorem \6.9l Let / G and denote by K the upper bound of |/|, i.e. \f (x)\ < K for all x. Define 
!n (•) := (/, Mn (')) = i\ Yn=i f ( x i ("))• Clearly \I n (t) \ < K for any t. In order to prove tightness in D ([0, oo), R) 
we need to show that (|6.7p and (I6.8[) hold with Y n replaced with /„ . Since / is bounded it is immediate that (I6.7[) 
holds. Now let us show that (16.81) holds. Our basic observation is that 



\I n (t)-I n (s)\ = - 
n 



EtfC* (t))-f(xi (s))) 



(=i 



n n 

<-Y,\f fa (t)) - / ( Xi (S))\ < — V 1 [Xi (t) > x. t (s)] 
n * — » r) * — * 



n * — ' n 

i=l i=l 
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By monotonicity of Xi (•) and using (|6.12[) we have 

sup \I n (t)-I n (s)\>e) <P sup — V 1 [xi (t) > Xl (s)] > e 

Q<t-s<S / \ 0<t-s<8 n ■ ; 

\0<s<t<T / \0<s<t<T 1-1 

/ 2K n \ 
< P sup yib(s + 25) > x, (s)l > s 

Vo< s <t n ^ y 

sup V 1 [ Xl (s + 25) > (s)l > £ . 

o<,<t n ^ y 

To abbreviate notation, define Y n j (s) := — Y^i=i 1 [^i ( s + ^) > ^« ( s )]- Now if sup 0<s<T Y n ^s (s) > e, then 
there exists an integer k, < k < \T/S] — 1, such that Y n j (k5) > e/4. Suppose the contrary, that for every k 
the above expression is less than e/4; then since an interval of length 25 can intersect at most three neighboring 
intervals of length 5, it follows that sup 0<s<T Y n ^s (s) < 3e/4. Therefore 



sup Y ni2S (s) > e < P (3k, < fc < [T/<5] - 1 : Y nfi (k5) > e/4) 

0<s<T / 

= 1 - P (Vfc, < k < \T/5] - 1 : Y ni5 (k5) < e/4) 

= l-(P(y n , 4 (0) <e/4)) rT/51 

= l-(l-P(F„ il5 (0) >e/4)) rT/51 , 

where we used that for every i, Xi (•) has independent and stationary increments. We bound the probability in 
the last line using Markov's inequality: 

P ^ 1 % (*) > ^ (0)] > ^) < E 1 (5) > xt (0)]) j /^, 

where 

E ^ ^ 1 [£i {5) > it {0)]j j = n¥ ( Xl (5) > x x (0)) + (n 2 - n) P (£i ((5) > f i (0) and x 2 (5) > x 2 (0)) 

= n (I - e- aS ) + (n 2 - n) (l- e - a5 ) 2 
pa rta<5 + (rt 2 — ft) (a<5) 2 , 

from which (|6.8p follows immediately. □ 

Proof of Corollary 1 6. 1 0\ In order to prove this corollary we need to check the two conditions of Theorem l6.5l The 
second condition follows immediately from Theorem l6.9l It remains to show the first condition of Theorem l6.5l 
Due to monotonicity of the Xi (t) in t, we have 

sup fx n (t, (— oo, -if) U (if, oo)) < sup //„ (t, (-oo, — if)) + sup //„ (t, (if, oo)) 

0<t<T 0<t<T 0<t<T 

= /i n (0, (-co, -if)) + /i„ (T, (if, oo)) , 

and consequently 



supP sup \x n (t, (-co, -if) U (if, oo)) > e 

n \0<t<T 

< supP (//„ (0, (-oo, -if)) + ,i„ (T, (if, oo)) > e) 

n 

< supP (p n (0, (-oo, -if)) > e/2) + supP ( M „ (T, (if, oo)) > e/2) . 

n n 

Let us look at the second term: 

1 ™ 

M „(T, (if,oo)) = -^l[x ! (T)>if] 



-t n 1 n 

< - V 1 [x< (0) > if/2] + -Vl [ Xi (T) - Xl (0) > if/2] 

i=i i=i 
-i n 1 n 

< - Y 1 [a;, (0) > if/2] + - V 1 (T) - Xl (0) > if/2] 

77 Z ^ 77 Z ^ 



n ^ — ' n ■ 

i=l i=l 
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and so consequently 

su P P( M „ (T, (K,oo)) > e/2) < supP(//„ (0, (Jf/2, to)) > e/4) 



supP ^ £ 1 ( T ) " ^ (°) > K l 2 ] > £ l^j 



Let us look at the second term on the right hand side. Using Markov's inequality and the fact that the iii's are 
identically distributed, we have 

P [ - V 1 [£i (T) - Xi (0) > K/2] > e/4 ) < -P (x i (T) - ii (0) > K/2) . 

Now using Markov's inequality again, we get that 

-P ( Xl (T) - x x (0) > K/2) = -F ((fcx (T) - (0)) 3 > K 3 /&) < -||e ((% (T) - 5* (0)) 3 ) =: 
Now putting everything together we get 



supP sup n n (t, (-oo, -K)u(K,oo)) >s <supP(/x n (0, (-oo, -K)) > e/2) 

n \0<t<T J n 

c(T) 

+ supP (fi n (0, (K/2, oo)) > e/4) + 

n A e 

and we need the left hand side to be at most e, for which it is enough that the right hand side is at most e. 
So let T > and e > be fixed, and let us choose c = c (T), r = r (T), and if = -Kx,e as follows: 

c(T) := max {3c (T) ,8c 3 } 

r(T) :=l/3 

K TjE :=c (T) 1/3 e- 2 / 3 , 

where c is given by Assumption 2. Then K 3 > 3c(T)e" 2 , and so c(T) / (if 3 e) < e/3. On the other hand 
K > 2ce~ 2 / 3 , and so using Assumption 2 we have 

sup P 0„ (0, (-oo, -if)) > e/2) < e/3 

n 

supP {nn (0, (if/2, oo)) > e/4) < e/3. 

n 

Putting all these together concludes the proof. □ 

As mentioned before, there are other ways to check tightness, and here we sketch Thomas G. Kurtz's proof 
(see also Kotelenez and Kurtz [531 Appendix]). First note that 

di (ji,v) < E \X - Y\ , 

where X and Y are any random variables with distributions y, and v respectively. The paths of our particles 
{xi (•)}£_! are exchangeable, and so for any Ij 7 ^™*- ^ j-stopping time r we have 

(6.13) E (dj {p^ (r) , ii n (r + «))) < E (a* (r + «) - a?i (t)) . 

For each T > 0, let S n (T) be the collection of j Tj/™^ |-stopping times r satisfying t < T. Then by (|6.13p and 

Aldous's criterion (see e.g. Ethier and Kurtz [T31 Theorem 8.6]), {y n (•)}„>! i s tight in D ([0, oo), V\ (R)) if for 
each V • 0. 

lim limsup sup E (xi (t + u) — x\ (r)) = 0, 

and also condition (a) of |13l Theorem 7.2] holds. In our case, for a bounded jump rate function w, these 
conditions can be checked just like before, in the previous way of showing tightness. 
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6.4 Identifying the limit 

In this section we prove Theorem 16.111 Theorem 16.121 and Corollary 16.131 

Proof of Theorem \6.11{ Let Tt denote the filtration generated by the n particles at time t. If X (t) is measurable 
with respect to T% then let 

LX ft) := lim -|-E (X (t + dt) - X (t) I FA ■ 
dt-s-o at 

t 

We know that X (t) — X (0) — J LX (s) ds is a martingale. Let t > and / <E H be fixed from now on. As in 

o 

the previous section, define I n (•) := (/, /i„ (•)) = ~ / {xj (•)). Calculation shows that 

LI n (t) = {(E (/ (a; + Z)) - / (x))w (x - m„ (t)) , /i„ (i)> , 

which means that M„ (t) := A t j ([i n (•)) i s a martingale in t with M„ (0) = 0. Therefore by Doob's inequality 
(see e.g. Jacod and Shiryaev p. 11]) we have that for any c > 0, 



< 



E I sup M„ ( s ; 
sup \M n (s)\ > c ) < K °- s -\ 

0<s<i / C z 

So in order to show (16.91) we need to show that 

(6.14) lim E(Af?(t)) =0. 

n— >oo 

We know that 

t 

N n (t) :=Ml{t)- j LM 2 n (s)ds 
o 

is a martingale with N n (0) = 0, and therefore 



4E {Ml (*)) 



(6.15) 



(M= (t)) = E / LM^ ( s ) ds = / E (LM^ (*)) d s 



The order of the expectation and the integral can be switched due to the Fubini-Tonelli theorem, because 
LMl (s) > as we will see shortly. Let us calculate LMl (t): 



LMl (t) = hm -E {Ml (t + dt) - Ml (t) | F t 



im -E ( (M„ (f + dt) - M„ (t)) 5 
;->o at \ 



lim 



■Ft , 



where we used that M n (t) is a martingale. Since LJ„ (s) is bounded, we have 

t+dt 

M n {t + dt)-M n {t) = {I n {t + dt)-I n {t))- f LI n {s)ds 



and so consequently 



LM: 



= {I n {t + dt)-I n {t)) + 0{dt) 



(t) = lim i-E f (M n (t + dt) - M„ (t)) 5 

= lim 1e ((/„ (t + dt) - /„ (t)) 2 Ji) 
dt— at v / 

= ^fcs E ((g(/(^ (*+*)) -/(**(*)))) 



i ™ 

^ J2 w {xi (t) - m n (t)) E ((/ fa (t) + Z)-f (x t (t))) 2 ) 



Now if / G C 6 , |/| < 1, then LMl (*) < 4a /«- If / = H then LM ™ (*) < a # \ z ) l n - In both cases this shows, 
via ([6~T5]) . that (prT4"|) holds. □ 
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Proof of Theorem \6.1SX Recall that w < a and that \w'\ < a' . 
We first show that for every t > and every / G H, 

(6.16) (f,[i n (t))^ n (f,[i(t)) 

in R. First of all, we know that the projection n t ■ D ([0, oo), Pi (K)) — > V\ (R), given by 7r t (i/ (•)) = (t), is 
continuous at ^ (■) if and only if v (■) is continuous at t. Since \l (■) G C ([0, oo),Pi (R)) a.s. (by Corollarv l6.10l) . 
the continuous mapping theorem and the fact that fi n (•) =^ n /i (•) in D ([0, oo), Pi (R)) together imply that for 
every t > 0, 

(*) =>n M (*) 

in V\ (R). Now for any f £ H, the function /i/ : V\ (R) — ► R, given by hf (v) = (f, v), is continuous. (This 
follows easily from the equivalence of (1) and (2) in Lemma I6.1H Using the continuous mapping theorem, this 
shows (|6.16p . 

We introduce <?/ (x) := (E (/ (x + Z)) — f (x)) in order to abbreviate notation. What remains to be shown 
is that for every t > and every / £ H, 

t t 

(6.17) J {g f (x) w(x- m n (s)) , fx n (s)) ds J (g f (x) w [x - m (s)) , \i (s)) ds 
o o 

in R. First, let us show that for any function f E H, the function hf : V\ (R) — > R, given by 
hf(v) = {gf(x)w{x — M),v), is continuous, where M = fxis(dx). So suppose limjj-j.oo di (i/ n , j/) = 0, and 
denote by M n the mean of v n , i.e. M„ = J xv n (dx). We need to show that in this case 

lim \(g f (x) w(x- M n ) , v n ) - (gt (x) w (x - M) , v)\ = 0. 

n— ^oo 

We use the triangle inequality to arrive at 

| (g f (x) w(x- M n ) , v n ) - (g/ (x) w (x - M) , v) | < | (g f (x) [w (x - M n ) - w (x - M)] ,v n )\ 

+ \(g f (x) w(x- M) , v n ) - (g f (x) w (x - M) , v) | . 

The first term on the right hand side can be bounded above by 2a' \M n — M\. This term goes to zero as 
n — > oo, due to lirrin-^oo d\ (y n , v) — and the equivalence of (1) and (2) in Lemma 16.11 For every / 6 H, 
gf (x) w {x — M) is a bounded and continuous function, and consequently the second term goes to zero as n —> oo 
as well, due to lim n _ ! . 00 d\ (y n , v) = and the equivalence of (1) and (2) in Lemma |6.1I Thus hf is continuous 
for every / G H . 

Consequently the continuous mapping theorem and Lemma 16.71 together imply that 
(9f ( x ) w(x~ m n (■)) , Hn (0) (gf (x) w(x-m (•)) , fj, (•)) 
in D ([0, oo),R). Lemma \Q. 81 then implies that 

(gf (x) w(x-m n (s)) , fi n (s)> ds =^„ J (g f (x) w (x - m (s)) , fi (s)) ds 
o o 

in D ([0, oo), R). The integral function on the right hand side is continuous, which implies that the one- 
dimensional distributions converge weakly in R, i.e. (16. 1 7[) holds. □ 

Proof of Corollary [6J3[ Fix t > and / G H. Theorem IBTTTI implies that 

A t,f {p-n (•)) 
in R. On the other hand, Theorem 16.121 implies that 

At J (Mn(0) A t j (/i(-)) 

in R. However, we know that a sequence cannot converge weakly to two different limits (Billingsley [SJ Section 
2]), and hence A t j (fi (•)) = almost surely. □ 
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6.5 Uniqueness of the solution to the mean field equation 

In this subsection we prove Theorem 16. 141 and Corollarv l6.15l 

Proof of Theorem \6.14\ We have to show that there exists a constant c such that (|6.11[) holds. In order to do 
this we prove that there exists a constant c such that 



(6.18) d H {t) <d H (P)+cJ d H (s)ds, 

o 

from which (|6.11[) follows using Gronwall's lemma. 

So let us now prove f)6.18|) . Denote by m 1 (t) and m 2 (t) the mean of \i x (t) and y 2 (t) respectively: m l (t) — 
J x[i % (t,dx) for i = 1,2. We know that /i 1 (•) and y? (•) are solutions to the mean field equation, with initial 
conditions /LtJ and fx 2 respectively, and so 

A t ,/(//(-))=0 

holds for all f £ H, for i — 1,2 (recall (|3.8p ). Using this, the definition of the metric dn , and simple inequalities, 
we have 

d H (t) = sup | (/,/** (t)>-(/,/i 2 (t))\ 
feH 

< sup |(/,A*o) - (/>Mo}| 
feH 

t 

+ f sup|((E(/( a ; + Z))-/( a :)) W (x-m 1 ( S )), M 1 ( S )) 
J feH 
o 

- ((E (/ (x + Z))-f (x)) w(x-m 2 (a)) , y 2 («)) | ds. 

The first term on the right hand side is exactly dn (0), and so what we have to show is that the supremum 
inside the integral is at most a constant multiple of dH (s). As in the proof of Theorem 16. 121 define 

g, Or) := (E (/ (x + Z)) — f (x)) 

in order to abbreviate notation. We know that gia = E(Z) = 1 and that for every / £ Cb, |/| < 1, gf is 
continuous and \gj\ < 2. First we use the triangle inequality: 

SU P |(S7 (x)w(x - m 1 (s)) (s)) - (g f (x)w(x - m 2 (s)) , /i 2 (s))| 
/e-ff 

< sup I (5/ (a;) w (x - m 1 (s)) , ^ (s)) - (.9/ (x) w (x - m 1 (s)) , (s)) I 
+ sup I (.9/ (x) w(x- m 1 (s)) , /i 2 (s)) - (5/ (x) w (x - m 2 (s)) , ^ 2 (s)) | . 

We bound the two terms separately by a constant multiple of dn (s). Let us start with the first term. If / G Cb, 
\f\ < 1, then gf (x)w (x — m 1 (s)J is continuous in x and its absolute value is at most 2a, so consequently 
the expression in the supremum is at most 2adn (s). If / = Id, then the expression in the supremum is 
K w (x — m 1 (s)) , /1 2 (s)) — (iu (x — m 1 (s)J , /i 1 (s)) | which is at most adu (s) since u; (x — m 1 (s)) is continuous 
in x and |w| < a. Thus the first term is at most 2adn (s). 

Now let us look at the second term. Using the fact that w' is bounded we have 



\w 



(x — m 1 (s)) — w (x — m 2 (s)) | < a' [m 1 (s) — m 2 (s) | < a'd// (s) , 



and since \g/\ < 2 for all / £ H , it follows that the second term is at most 2a! dn is). Therefore this shows (|6.18[) 
with c := 2a + 2a'. □ 

Proof of Corollary \6.15\ If dn (0) = then, due to Theorem 16.141 dn (t) = for all t > 0, and consequently 
fi 1 (t) = y 2 (t) for all t > 0, so /i 1 (•) = [i 2 (•). □ 

6.6 Proof of Theorem 16.51 

Before proving Theorem 16.51 let us present a result for general Polish space state spaces that we use during 
the proof. Suppose E is a Polish space and d is a complete metric on E. For x (■) E D ([0, 00), E 1 ), T > 0, and 
<5 > 0, let 

it; (x (•) , S, T) := sup d (x (i) , x (s)) . 

0<s,t<T,\s-t\<6 

(We note that this w should not be confused with our jump rate function w — the distinction will be obvious 
from the context.) Then: 
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Theorem 6.16. Suppose {X n (•)} n >i * s a sequence of cadlag E -valued processes, where E is Polish. {X n (•)}„>! 
is C -relatively compact in D ([0, oo), E) if and only if the following two conditions hold: 

(a) For every T > and every e > there exists a compact set K° = K^ e C E such that 

supP (X n (t) £ K° for some t<T) < e. 

n 

(b ) For every T > and every e > there exists a 5 > such that 

limsupP (w (X n (•) , S, T) > e) < e. 

n— >oo 

This theorem follows from Corollary 3.7.4, Remark 3.7.3, and Theorem 3.10.2 of Ethicr and Kurtz [T5] , 
Note: (a) is called the compact containment condition. 

Proof of Theorem \6.5i To prove that {P n (-)} n >i is C-relatively compact in D ([0, oo),Vi (R)) we need to show 
that conditions (a) and (b) of Theorem 16 . 1 61 follow from conditions (i) and (ii) of Theorem 16. 51 Notice that it is 
enough to demonstrate this for small e's. First, let us start with the compact containment condition. Let T > 
and e € (0, |) be fixed. To this T > 0, there exists a c = c (T) and a r — r (T) e (0, 1) given by condition (i). 
For every m > 1, let e m := ce/m 2 / T , where cr 1 = J2 m >i m ~ 2 ^ T so that Y^ m >i £m = £ - ^ ow condition (i), 
to the fixed T and e m there exists a K m := AV >£m e R such that 

A m < = m 2 /- 2 

- e)n T (&) 1 "" 



and 



supP sup P n (t, (-oo, -A m ) U (K m , oo)) > £ m ) < £,- 

n \0<t<T 



This implies that 

supP( sup P n (t, (-oo,-K m )U (K m , oo)) > l/m 2/T ) < £ m , 

ti \0<t<T / 

since e m < I/to 2 / 1 ". Now given the AT m 's, we define the following set of probability measures: 
C° := |/i <= Pi (R) : n ((-oo, -X™) U (A m , oo)) < l/m 2/r for every to > l| . 
(Of course C° = e .) The choice of the A m 's implies that 

sup P (P„ (0 £ C° for some t < T) < e, 

n 

since 

P (P„ (t) £ C° for some i < P) = P (3m > 1, 3t > T : P„ (t, (-oo, -A m ) U (A m , oo)) > 1/to 2/t 

= P f 3m > 1 : sup P„ (t, (-oo, K m ) U (if m , oo)) > 1/to 2 / t J 

\ 0<t<T J 

< V P f sup P„ (t, (-oo, -Jf TO ) U (if m , oo)) > 1/to 2 ^ 

m=l V0<t<T 7 

OO 

< ^2 e m = e. 

m— 1 

Consequently, 

supP (P„ (i) £ A for some t < T) < e, 

n 

for K° := C°. This if will be good as the compact set needed in condition (a) — what is left to show is that 
K° is compact in V\ (R). For this, it is enough to show that C° is relatively compact in V\ (R). We know (see, 
for instance, Feng and Kurtz [T3J Appendix D]) that C° C Pi (R) is relatively compact if and only if 

(6.19) lim sup / \x\fi(dx)=0. 

\x\>N 
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So let us now check (I6.19[) . Note that this is where the original form of Perkins' theorem Theorem II. 4.1] 
is not enough and we have to use our modified version, Theorem 16.51 If N > K m , then 



/oo 
|a;| /i (dx) < sup >. 



\x\ /i (dx) 

c\>N ' l=rn [-K l+1 ,-K,)U(K„K, + i] 

OO 

< sup V^ /+ iM((-oo,-/f / )U(if / ,cx))) 

l=m 

OO 

This last expression goes to as m — > oo, which shows (|6.19p . and consequently we have shown that condition 
(i) implies condition (a). 

Now let us show that conditions (i) and (ii) imply condition (b). First we show that for every / € C (Pi (R)), 
{/ ° Pn (")}n>i ^ s C-relatively compact in D ([0, oo), R). By the characterization of C-rclatively compactness 
we need to check conditions (a) and (b) of Theorem 16. 161 Due to the boundedness of /, (a) is immediate. Now 
let us show (b). Fix / £ C b (Pi (R)), T > 0, and e > 0. Choose K° = K° E as in (a). Define 

k m% 

A = {ft : Vi (R) -> R | h (fi) = J2 a H (Ai>M> , <N 6 M, /y € C&, fc, m, £ Z+ } C C fc (Pi (R)) . 

i=l j=l 

Then A is an algebra containing the constant functions and separating points in V\ (R). So by the Stone- 
Weierstrass theorem (see for instance Willard [33] ) there exists an h £ A such that 

sup \h(fi) -/(a*)| <£. 

M6-R" 

If {F„ (•)}„>! an d {Z n (0} n >i are C-relatively compact in D ([0, oo),R) then so are {aY n (•) + bZ n (•)}„>! an d 
{1^ (•) Z„ (•)}„>! for any a, 6 £ R (see the characterization of Theorem I6.16[) . Therefore condition (ii) of the 
theorem implies that {ft o P n (•)}„>! is C-relatively compact in D ([0, oo),R). Now using the characterization 
of Theorem 16.161 again, there exists a 5 > such that 

limsupP(w(ftoP n (.),<5,T) >£)<£. 

n— ¥00 

To abbreviate notation, we introduce P„ ([0,T]) := {P„ (t) : t £ [0,T)}. If s,t < T, then 

|/ (P n (t)) - / (P n (s))| = |/ (P„ (<)) - / (P n (s))\ 1 [P„ ([0, T}) C *T°] 
+ W)-/(P„( S ))|l[P„([0,T])^if°] 
< 2 sup |ft {n) - f (jm)\ + \h (P n (<)) - ft (P„ (s))| 

+ 2||/|| oo l[P„([0,T])^°] 
and so by the definition of w (/ o P„ (•) , (5, T) we have 

w (f o P„ (•) , S, T) < 2e + w (ft o P„ (•) , 6,T) + 2 H/l^ 1 [P„ ([0, T]) £ *T°] . 

Therefore 

limsu P P(w(/oP„ (),S,T) > 3e) < limsupP(P„ ([0,T]) £ X ) + limsupP (to (ft o P„ (•),£, T) > e) 

n-^oo n— foo n—>ao 

< e + e = 2e < 3e, 

by the choice of if and the fact that {h o P„ (■)}„>! is C-relatively compact in D ([0, oo),R). 

So let us recap what we have done up until now: we have shown that (i) implies the compact containment 
condition and that (i) and (ii) imply that for every f £ Cb (Pi (R)), {/ ° P n (')}„>i i s C-relatively compact in 
D ([0, oo), R). These imply that {P„ (-)} n>1 is relatively compact in D ([0, oo), R) (see Theorem 3.9.1 of Ethier 
and Kurtz [H]). What remains is to show that it is not only relatively compact, but also C-relatively compact. 

We now show that (i) and (ii) imply (b). Let us define the metric d := min{di, 1} and let 

w d (P„ (•) , 5, T) := sup d [P n (t) , P„ (s j) . 

0<s,t<T,\s-t\<6 
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Let T > and e > be fixed, and choose K° as in (a) again. Since K° is compact, we can choose [ii £ K°, 
i = 1, . . . ,M, so that if C U^P (^, e), where 

B (/i, e) = G Pi (R) : di (/i, v) < e} . 

Since e < 1, 

B (/i, e) = {2/ G Pi (R) : d (/*, z/) < e} . 

Now let fi(n) := d((H,n). Clearly / 4 £ C 6 (P x (R)). We have proved that for every / G C 6 (Pi (R)), 
{/ ° Pn (')}n>i * s C-relatively compact in D ([0, oo),R), which implies that there exists a S > such that 

M 

(6.20) 5] lim sup P (w (/< o P„ (•) ,8,T)>e)< e. 

. - n— >-oo 
1 

If /x, 1/ £ if , choose /ij so that d (i>, ^) < er. Then 

difav) < d(fj,,tij) +d(pj,v) < \d(p 7 fi 3 ) -d{iij)V)\ + 2d(fi jl u) < max|/i (/z) - /* (u)\ + 2s. 

Now let 5, t < T; the above inequality implies 

d (P„ (t) , P n («)) = d (P„ (t) , P n (a)) 1 [P„ ([0, T}) C if ] + d (P„ (t) , P„ («)) 1 [P„ ([0, T]) £ if ] 
< max |/, o P„ (t) - / 4 o P n ( a ) | + 2e + 1 [P„ ([0, T]) £ if ] , 

where we used that d < 1. Consequently 

w d (P„ (■) ,5,T) < maxw (/< o P„ (•) , 5, T) + 2e + 1 [P„ ([0, T]) £ if ] . 

Now note that {m; (P n (•) , S,T) > 3e} and {wd (P n (•) , P) > 3e} are exactly the same events, and consequently 
lim sup P (to {P n (•) ,5,T)> 3e) = lim sup P (u> d (P n (•) , 5, T) > 3e) 

<limsupP(maxw(/,oF n (-),,5,T) > e) + lim sup P (P„ ([0, T]) ^ A" ) 

n— foe \ t / n— >oo 

< lim sup V P (io (fi o P„ (■) , S, T) > e) + lim sup P (P„ ( [0, T}) £ if ) 

n— >oo . .. n— >oo 
^— 1 

< e + e = 2e < 3s, 

using (|6.20D and the choice of if . This shows condition (b), and therefore completes the proof. □ 
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